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Abstract —New bounds on the cardinality of permutation 
codes equipped with the Ulam distance are presented. First, an 
integer-programming upper bound is derived, which improves 
on the Singleton-type upper bound in the literature for some 
lengths. Second, several probabilistic lower bounds are developed, 
which improve on the known lower bounds for large minimum 
distances. The results of a computer search for permutation codes 
are also presented. 

Index Terms —Permutation codes, rank modulation, Singleton 
bound, sphere-packing bound, Ulam distance. 

I. Introduction 

A permutation code is a subset of the symmetric group 
equipped with a distance metric. Permutation codes are of 
potential use in various applications, such as communications 
over Gaussian channels ca, Go), power-line communica¬ 
tions 0, m, and coding for flash memories used with rank 
modulation 0. Permutation codes were extensively studied in 
the literature over the last decades. In most of these studies, 
permutation codes are equipped with the Hamming and the 
Kendall r metric ll5l. M. ifT^. 

Permutation codes were recently proposed for storing infor¬ 
mation in non-volatile (flash) memories under rank modula¬ 
tion a, nni, im, na. The main idea of the rank modulation 
scheme is that the information is stored in the form of rankings 
of the cell charges, rather than in terms of the absolute 
values of the charges. Rank-modulation codes represent a 
family of codes capable of handling errors of the form of 
adjacent transpositions m, ESI or translocations jg. Such 
error patterns are typical for memory systems, where leakage 
of electric charge occurs over time. 

There are two types of errors where a permutation code 
equipped with the Ulam or Kendall r metric could be of use. 
One such type is overshoot errors, in which a cell receives 
more charge than it is supposed to. The second type is the 
errors, in which a defective cell loses charge more quickly than 
normal. Both of these types of errors constitute one error in the 
Ulam metric or a number of errors in the Kendall r metric. 
Thus, codes in the Ulam or Kendall r metrics seem to be 
appropriate for error detection and correction in the paradigm 
of rank modulation. 

The problem of estimating the maximum size of a code in 
the Ulam metric for given parameters is very difficult. Dif¬ 
ferent mathematical tools could be applied to this problem. In 
this work, we demonstrate that novel bounds on the maximum 
size of a code can be obtained by an integer-programming 
method and by probability estimation techniques. These two 
approaches deal with different regimes: the probability bounds 


are useful for large n (some results, like Proposition IIV.3I are 
of the asymptotic nature only), whilst the integer-programming 
approach works well with (relatively) small n. 

II. Notation 

Denote by Zq the set of non-negative integers. We also use 
the notation [n] = {1, 2 , • • • , n}. 

A permutation cr : [n] —>■ [n] is a bijection. Let §„ 

denote the set of all permutations of the set [n], i.e., the 
symmetric group of order n\. For any cr G we write 
(j = [cr(l), (t(2), • • • , fT(n)], where cr(i) is the image of i G [n] 
under the permutation cr. This is called the one-line notation 
of permutation a. The identity permutation [1,2, ••• ,n] is 
denoted by e, while stands for the inverse of the per¬ 
mutation cr. 

Let d : S„ x S„ —>• be a metric defined for pairs of 

permutations. A permutation code of length n and minimum 
distance d in a metric d is a subset C of S„, such that for all 
T^a ^ C, T ^ a, SN& have d(r, a) > d. Such a code will be 
also called an (n, d) code in a metric d. 

Definition II.l. Assume that l<i<j<n. A permutation 
T G E>n is a right translocation if 

r = [I,-- - l,i-f l,(-f 2,-- - -{-!,■ ■■ ,n] . 

A permutation t G Sn is a left translocation if 

T= [I,-- - I,-- - I,-- - ,n] . 

Next, we define the composition of two permutations. 

Definition II.2. Let r and a be two permutations in S„. 
Then, their composition ra is a permutation in S„ defined as 

Vi G [n] : (Ta)(i) = T(cr(()) . 

Under composition of permutations, §„ forms a group, 
called the symmetric group of order n. 

Definition II.3. The Ulam distance dj/(cr, p) is the smallest 
integer m such that there exists a sequence of (right and left) 
translocations ..., Tm, such that p = (7TiT 2 ■ ■ ■ Tm- 

Definition II.4. A subsequence of length m of a = 
[cr(l),..., cr(n)] is a sequence of the form [cr(ii),.. ., 
where U < U < ■ • ■ *m- Tef r, cr G S„. The longest common 
subsequence of t and a is a subsequence of both t and cr of 
the longest possible length. 

We denote the length of a longest common subsequence of 
r and cr by L(r, cr). Similarly, L(cr) = L(cr, e), the length of 




a longest increasing subsequence of r. It is well known ii 
that for any cr € S„, 

d( 7 (r,cr) = n - L(r,cr) . (1) 

III. Integer-programming bound for the Ulam 

METRIC 

A. Known bounds 


and an objective function 

n 

a—1 

By combining this, we obtain the following linear program 
in Figure [U where its maximum provides an upper bound on 
A(n, d). 


Denote by A{n, d) the maximum size of a code over 
equipped with the Ulam metric. The following theorem 
provides bounds on A{n,d) 191 . 

Proposition III.l. For all n,d & ^o" with n> d> 1, 

(ji — d iV 

^ <A{n,d)<{n-d+l)l. (2) 

W-lj 


The right-hand side of (|2]i will be referred to as the Singleton 
bound in the sequel. 


max 

s.t. 


Va G G [n — d + 1] 



■ 2fb,a < 

n 


(d-1)! 


V6 G [n— 1] 
Va, b G [n] : 




'^^Xb+l,a 

a—1 


Xb,a > 0 


Fig. I. General integer-programming bound. 


B. Integer-programming bound 

In this section, we derive an integer-programming upper 
bound on A(n, d). 

Let C C S„ be a permutation code of Ulam distance d. It 
follows from O, that any subsequence of length n — d 1 
appears at most once in any codeword of C (in other words, 
any two codewords in C cannot have the same subsequence of 
length n — d-\-l or more). We use this fact in order to define 
integer variables Xb^a for all 1 < a < n, 1 < 6 < n. More 
specifically, 

Xb,a = |{cr G C : a{b) = a}| . 


In other words, Xb,a counts a number of codewords with a in 
position b. 

Assume that a G C, such that a(b) = a. Then, the number 
of different subsequences of a of length n — d-\-l of the form 
a, •, • • • , •), where a{b) = a and 1 < £ < n, is 


£ 

given by 


n—d—l 


6-1 


n — b \ 

-d-i) ■ 


On the other hand, there are different sequences of 

the form a, •, • • • , •). 



By a simple counting argument, we obtain that for all a G 



■ Xb.a < 


(n — 1)! 

Jd^.- 


(3) 


The total number of the codewords can be obtained, for 
example, by Xb,a, for any b G [n]. Therefore, we add 

constraints 


Next, observe that Xb^a should be an integer. Therefore, we 
are interested in an integral solution to this linear-programming 
problem. This provides a tighter upper bound than the frac¬ 
tional solution to the same LP problem. 

Example III.l. Take n = 5 and d = 3. The corresponding 
integer linear-programming problem is shown in Figure |2] 


max X;Li ^i.a 
s.t. Va G [5] : 

1- (2) + Q + Q •2f3.a<12 

1 • 3 • X2.a + 2 • 2 • ATg.a + 3 • 1 • X4.a < 12 

5 5 

V6G[4]: 

Va, 6 G [5] : X^a > 0 


Fig. 2. Integer program for n = 5 and d = 3. 

After simplification, this integer-programming problem be¬ 
comes as in Figure [3 

max 

Sa=l 


s.t. 

Va G [n] : 

GXi^a + 3 X 2 ,a + 2^3,0 ^ 12 

3 X 2 ,a + 4X3.a + 3X4.a < 12 

2^3,0 + 3X4^a + GXb,a ^ 12 

5 5 


V6 G [4] : 

^ ^ ^b,a — ^ ^ ^b-\-l,a 
a—1 a—1 


Va, 6 G [5] : 

Xb,a > 0 


Fig. 3. 

Simplified integer program. 


n n 

V6 G [n - 1] : ^ Xb,a = ^ Xb+i,a , 

a—1 a—1 


2 


By solving the integer-programming problem in Figure [3 
we obtain that the maximum of the objective is obtained, for 












example, for Xf, Q = 1 for all a,b G [n]. This corresponds to 
the upper bound A{n, d) < 5, which improves on the value 
6 obtained by using the Singleton bound. The actual value of 
A{n, d) in this case is 4. 

We remark, that the proposed integer LP problem can be 
further tightened by using additional constraints. For example, 
one can dehne additional variables X,^bi,ai),ib 2 ,a 2 ),- 
where all ai,bi,t & [^]- Such a variable will count the number 
of permutations a, such that a{bi) = Ui for all i G [f]. 
Additional constraints can be dehned in a manner similar 
to (I3]l, with respect to variables .(bt.a*)- 

IV. Probabilistic bounds 
A. Asymptotic version of the lower bound 

In what follows, we consider an (n, d) Ulam code. Denote 
A = d — 1. Recall the bounds in Proposition IIII.ll By using 


( 771 \ ^ 

—j = exp [m(lnrn — 1)] 


and 

we obtain 
(n-A)! 

~AS~ 


m - 


( 771 \ 

< exp[TOhe(a)] , 


> exp 


(n — A) (ln(n — A) — 1) — nhg 


(4) 

Here he{p), where p G [0,1], is the binary entropy function 
with base e, i.e. he{p) = —pln(p) —(1—p) ln(l—p). Hence, (|4]i 
is an asymptotic lower bound on A{n, d). Consider a special 
case of it, when A = n — Cy/n, c is a constant. Then 1 — = 

cjyfn, and (HI becomes 


exp 

= exp 


yfnc ( - In n + In c — 1 ) — nh^, ( 1- — ) 

y/n)\ 


yfnc (2Inc — 1) 


> exp [i/nc(2Inc — 1) — ci/n] 
= exp [2y/nc ■ (Inc — 1)] . 

Hence, with A„ = n — Cy/n, we have 

{cy/n)\ 




lim inf 


: In 


\ (c^) 


> 2c • (Inc — 1) . 


(5) 


( 6 ) 


Let us now show that 2c • (Inc — 1) is actually the limit. 
Indeed, it holds to! = (1 + o(l)) v/Sttto (for large 

values of to). Then, provided that [n — A) ^ oo. 


In 


(n — A)! 


O 


< {n — A) (ln(n — A) — 1) 


+ - ln(27r(n — A)) + ln(l + o(l)) 


— nhe 


+ ln(n + 1) . 


Hence, since A = n — Cy/n, 
{cy/n)\\ 



^(1_ " )ln(l- " ) 

c y/n y/n 


+ - \n{2'KCy/n) + ln(l + o(l)) + ln(n + 1) . 


Since 


y/n 


In 1 - 


we obtain that 


lim sup —= In 
n y/n 



-1 


< 2c(lnc — 1) . 


(7) 


By combining (|6]l with (ITJ, we have the following result. 

Proposition IV.l. Let A = n — Cy/n, where c is a constant. 
Then, 


lim —= In 
n y/n 


1 ({n- A„)! 


(L) 


= 2c • (Inc — 1) . 


( 8 ) 


B. Bounds using longest increasing subsequence 

The following bounds hold for the Ulam metric (for the 
Kendall r metric see, e.g., El): 

< A{n, d) < 1^ I , (9) 


71 ! 


|6(A)| 


e(f)l ’ 


where |yB(r)| A |{cr : d [/(ejCr) < r}| is the number of 
permutations in the ball centered at the identity e and having 
radius r. The number of permutations in a ball B{r) is difficult 
to estimate. However, under the uniform distribution over all 
permutations in the ratio |S(r)|/n! is just the probability 
that a randomly chosen permutation is at distance at most r 
from e. In terms of the longest increasing subsequences, thus, 

1 ^( 7 ") I 

-— = P(n - U < r) = P(U >n-r), (10) 

n\ 

where L„ is the length of a longest increasing subsequence of 
a random permutation under the uniform distribution. In terms 
of L„, the inequalities (|9]l can be rewritten as 

P{K>n-A) -^(""’A + ljA p(l^>^_a/ 2 )' 

By combining this with (|2l), when A is even, we obtain the 
following probability estimates 


P(U > n 


A/2)< 



(n — A)! 


and P(L„ > n 


A)> 


1 

(n — A)! 


( 11 ) 


The study of the properties of the random variable L„ has a 
long history, starting with the pioneering paper of Ulam E2l . 
where the question of asymptotic behavior of E[L„] was stated. 
This so-called Ulam’s problem deserved attention of many 
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researchers over several decades. In a sense, the problem was 
solved by in the celebrated paper HI, where the limit law of 
(properly centered and scaled) L„ was found. In particular, 
they showed that for every f £ M (as n increases), 

<t] ^F{t) , (12) 

V ne / 

where F{t) is the distribution function of the Tracy-Widom 
law. For a historical overview of Ulam’s problem, the proof 
ofdUli, as well as the state of the art, we refer the reader to the 
book lfT9l . Since the random variable L„ has been studied for 
a relatively long time, one hopes that a proper upper estimate 
on the probability P{Ln > n — A) (or, alternatively, a lower 
estimate on the probability P(L„ > n — A/2)) gives also a 
good lower (upper) bound on A{n, d). 

In what follows, we aim at bounding the probability P(L„ > 
n — A) from above. The following simple estimate can be 
found in ESI page 9]: 

That estimate gives another proof of the lower bound (|2]i. In 
order to improve it, the probability estimate has to be superior 
to EH). When n — 2 > A > 1, then the inequality in (fTSI) is 
strict, and that follows from the use of the Markov inequality 
in the proof. Hence, the lower bound in © is always tighter 
than the bound (|2|i. We have the following result. 


Proposition IV.2. The inequality 
(n — A)! ^ n\ 

holds and for 0 < A < n — 1, the inequality is strict. 

Proof: Apply ESi and E^ . ■ 


Bounds for d = n — c^fn : One of the first probability 
estimates on P(L„ > n — d) was established by Kim EH- 


Thus, for any t £ 


(0,ni/20 


, it holds that 


P — 2^/n >tno^ < exp 


-_t2 


(14) 


where 


fit) = 


t 5 Inn 
— + 


. 


,27n3 t^ni 
That estimate leads to the lower bound on A{d, n) for n — 
Cy/n, where c£ (2,2 + 1/20], which is approximately 

'38-c\ 

27 ) 


exp 


(c-2)t 


\/n 


This is the same order as the bound exp[2Y^c(ln c — 1)], but 
the constant in the expression is smaller. This bound holds 
only for c very close to 2 and above 2. 


The best code rate estimate for large d is given by the 
following large deviation principle El- For every c > 2, 

lim —^ InP (L„ > c-\/n) = —T(c) , (15) 

n -^n 


where 

— J(c) = —2c cosh“^ + 2\/— 4 



In terms of the lower bound, ESi) can be stated as follows. 

Proposition IV.3. For every constant c > 2, the following 
convergence holds: 

lim ^ In f = 


where A„ = n — Cyfn — 1 and —lie) is given in ( 17^ . 

Proof: Use EOl l together with ESl) and ([Sll. Note that ® 
is formally proven for A„ = n — Cy/n, but it also holds for 
A„ = n — Cy/n — 1. ■ 

This proposition yields an asymptotic improvement on the 
lower bound in (|2]i. 

We note that any probability estimate that is better than 
the very simple estimate in ES, gives a better lower bound 
on Ain, d) in comparison with the existing lower bound E). 
Except for large d, there are no better estimates known. 
On the other hand, any good upper bound on |P(n — A)| 
entails also a good estimate on the probability P(L„ > A). 
Since the probabilities P(L„ > A) are closely related to the 
Tracy-Widom distribution, such a link between coding and 
probability theory might be valuable. 

V. Computational results 

In this section, we present computational results related 
to the optimal codes. It turns out that there exist non-trivial 
Ulam-metric codes, which attain the Singleton bound with 
equality. We call such codes Singleton-optimal. Singleton- 
optimal Ulam in, d) codes are also known as perfect deletion- 
correcting codes on n distinct symbols, capable of correcting 
d — 1 deletions and as directed Steiner systems ES). They 
exist for every n and d = 2 ES, and also for n = 6 and 
d = 3 ETI . It was also found by the exhaustive search in ETll 
that ^(7,4) = 12. We have complemented these results for 
other pairs (n, d). 

Table U summarizes what is known about Ain,d). These 
results are obtained by using computer search, and they 
improve on the theoretical bounds in many cases. Table HJ 
summarizes the experimental results on the existence of the 
Singleton-optimal (n, d) Ulam codes. 

In order to obtain these results, we construct the graph on 
the vertex set S„ with an edge if and only if the corresponding 
vertices are at least a distance d away. Our goal is to find a 
clique of the maximum size. 

We assign colors to the vertices of this graph, such that color 
class of a permutation corresponds to the relative ordering 
of symbols l,2,...,n — d+1 in the one-line notation of 


/(c) > 2c(lnc — 1) 




s/n 


(i) 


4 




















d = 2 

d = 3 

d = 4 

d = b 

d = Q 

d = 7 

n = 4 

6 

2 

- 

- 

- 

- 

n = 5 

24 

4 

2 

- 

- 

- 

n. = 6 

120 

24 

4 

2 

- 

- 

n = 7 

720 

> 59 and < 120 

12 

4 

2 

- 

n = 8 

5040 

7 

< 120 

< 12 

4 

2 

n = 9 

40320 

7 

7 

< 120 

< 12 

2 


TABLE I 

Known maximum sizes of codes in the LFlam metric. 


the permutation. The existence of a Singleton-optimal code 
becomes equivalent to the property that the clique number of 
the graph is equal to its chromatic number. Thus, in order 
to obtain a Singleton-optimal code, we need to pick exactly 
one vertex from each color class, such that the induced graph 
forms a clique. 

To obtain a maximum-size code when a Singleton-optimal 
code does not exist, we need to pick at most one vertex from 
each color class. This makes the respective exhaustive search 
computationally much harder. 
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